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atg ggt gcg aga gcg tea gta tta age ggg gga gaa tta gat cga tgg gaa aaa act egg 

M1 

ttA cca ccc C q» aag aaa aaa tat aaa tta aaa cat ata gta tog cci ace c . ff 

4%$ C Q C CC 5 C C M * ? 

eta gaa cga ttc gca gtt aat ccx ggc cxg tta gaa aca tea gaa ggc tgt aga caa at* 
su M2 

ctg gga cag eta caa cca tec ctt etc aca ca« te a gaa gaa ctt aca tea tta tat 
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Identification of INS regions within the 
env mRNA using the p19 vector. 
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COMPLETE NUCLEOTIDE SEQUENCE OF p37M-1-10D 

AND 

AMINO ACID SEQUENCE OF p37 gag PROTEIN (SEQ ID NO:129) 
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1 TGGAAGGGCT AATTTGGTCC CAAAAAAGAC AAGAGATCCT TGATCTGTGG ATCTACCACA CACAAGGCTA 

71 CTTCCCTGAT TGGCAGAACT ACACACCAGG GCCAGGGATC AGATATCCAC TGACCTTTGC ATGGTGCTTC 

1<1 AAGTTAGTAC CAGTTGAACC AGAGCAAGTA GAAGAGGCCA AATAAGGAGA GAAGAACAGC TTQTTACACC 

211 CTATGAGCCA GCATGGGATG GAGGACCCGG AGGGAGAAGT ATTAGTGTGG AAGTTTGACA GCCTCCTAGC 

281 ATTTCGTCAC AXGGCCCGAG AGCTCCATCC GGAGTACTAC AAAGACTGCT GACATCGAGC TTTCTACAAG 

351 GGACTTTCCG CTGGGGACTT TCCAGGGAGG TGTGGCCTGG GCGGGACTGG GGAGTGGCGA GCCCTCAGAT 

421 GCTACATATA AGCAGCTGCT TTTTGCCTCT ACTGGGICTC TCTGGTTAGA OCAGATCTGA GCCTGGGAGC 

491 TCTCTGGCTA ACTAGGGAAC CCACTGCTTA AGCCTCAATA AAGCTTGCCT TGAGTGCTCA AAGTAGTGTG 

' — > 

561 TGCCCGTCTG TTGTGTGACT CTGGTAACTA GAGATCCCTC AGACCCTXTT AGTCAGTGTG GAAAATCTCT 

631 AGCAGTGGCG CCCGAACAGG GACTJGAAAG CGAAAGTAAA GCCAGAGGAG ATCTCTCGAC GCAGGACTCG 

BssHII (711) 

701 GCTTGCTGAAGCGCGCGTCGACAGAGAGATGGGTGCGAGAGCGTCAGTATTAAGCGGGGGAGAA 

J> Mb t Gl yAl aArgAI aSer Va I LauSer Gl yGI yGI uLauAapArgTrp 

777 GAAAAAATTCGGTTAAGGCCAGGGGGAAAGAAGAAGTACAAGCTAAAGCACATCGTAT^ 
11> Gl uLya 1 1 ©ArgLeuArgP roQ yGI yLyaLyaLyaTy rLyaLauLyaHi a 1 1 aVal TrpAl aSer ArgGJ uLauG 

853 AACGATTCGCAGTTAATCCTGGCCTGTTAGAAACM 
42* I uArgPheAl aVal AanProGI yLeuLauGl uThr Sor Gl uGl yCyaArgGt nl I aleuGJ yGI nLauG! nProSe 

929 CCTTCAGACAGGATCAGAGGAGCTTCGATCACTATAC^^ 
67> r LauGI nThf Gl ySar Gl uGl uLauArgSer LauTyrAsnThr Val At aThr LauTy rCyaVa I Hi aGI nArg 1 1 • 

1005 GAGATOAGGACACCAAGGAAGCTTTAGACAAGATAG 

93» Gl u 1 1 eLysAtpThf Ly ad uAI aLeuAapLy a 1 1 aGI uGl uGl uGl nAanLyaSar Ly aLyaLyaAl aGI nGf nA 

1081 CAGCAGCTC^CACAGGACACAGCAATCAGGTCAGCCAAAATrACCCTATAGTGC^ 
118> I aAl aAl aAspThr Gl yHi aSerAanGI nVal Ser Gl nAanl^rProl I a Val Gl nAanl I aGI nGI yGI nfcbt Va 

1157 ACATCAGGCCATATCACCTAGAACTTTAAATCCATGGGTAAAAGTAGTAGAAGA 

!!► I Hi «G1 nAI al I aSer ProArgThr LauAsnAl aT rpVal LyaVal Val Gl uGl uLyaAl aPhaSar ProGI uVal 

1233 ATACCCATGTTTTCAGCATTATCAGAAGGAGCCACCCCACAGGACCTGAACACGATGra 

31* ' I aProMBt PheSer Al aLauSar Gl uGl yAl aThr Pr oGI nAsplauAanThr ft*t LauAanThr Val Gl yGI yH 

1309 ATCAAGCAGCCATGCAAATGTTAAAAGAGACCATCAATGAGGAAGCTGCAG^^ 

€2> laGlnAlaAlaAteianMatLauLyaGluThr ll aAanGI uGl uAI aAl aGI uTrpAapArgVal Hi a Pro Val HJ 

1385 TGCAGGGCCTAITGCACOIGGCCAGATGAG^ 

87* aAl aGI yProl I aAl aProGl yGI nMat ArgGJ uPro ArgGI ySarAapl I aAl «GI yThr Thr Sar Thr LauGI n 

1461 GAACAAATAGGATGGATGACAAATAATCCACCTATCCCAGTAGGAGAGATCT^ 

Gl uGl nil aGI yTrpMBtTnrAanAanProP rol I aProVal Q yGI ull aTyrLya ArgTrpI I a 1 1 aLauGI yL 

1537 TGAACAAGATCGTGAGGATGTATAGCCCTACCAGCATTCTGGACATAAGA^ 
138> auAanLy a 1 1 a Va I A rgMa t T y rSe r P roThr Sa r 1 1 aLauAapl I aArgQ nO yProLyaGl uProPhaArgAa 
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1613 CTATGTAGACCGGTTCTATAAAACTCTAAGAGCTGAGCAAGCT^ 
163> pTyrValAspArgPheTyrLysThr LeuArgAI sGl uGl nAI aSer Gl nGI uVal LysAsnT rpfctelThr GJ uThr 

1689 TTGTTGGTCCAAAATGCGAACCCAC^TTGTAAGACCATCCTGAAGGCTCTCGGCCCA 
169* LeuLeuValGi nAsnAI aAsnProAspCysLysThr 1 1 cLeuLycAl aLauGI yProAl aAl aThrlauGI uQ uM 



stop (1816) Xbal (1898)1 

1765 TGATGACAGCATGTCAGGGAGTAGGAGGACCCGGCCATAAGGCAAGA G1 1 1 1 CTAGGGATCCACTAGTTCTAGACT 
214> «tM9 tThr Al aCyaGI nGI yVa I Gf yGI yProGl yHi sLysAI aArgVal L«u 



1841 


Apal (1848) 

CGAGGGGGGG CCCGGIACCT 


TTAAGACCAA •TGACTTACAA 


GGCAGCTGTA 


GATCTTAGCC 


AC1111TAAA 


1911 


AGAAAAGGGG 


GGACTGGAAG 


GGCTAATTCA 


CTCCCAAAGA 


AGAGAAGATA 


TCCTTGATCT 


GTGGATCTAC 


1981 


CACACACAAG 


GCTAcrrccc 


TGATTGGCAG 


AACTAGAGAC 


CAGGGGGAGG 


GGTCAGATAT 


CGACTGACCT 


2051 


7TGGATGGTG 


CTACAAGCTA 


GTACCAGTTG 


AGCCAGATAA 


GGTAGAAGAG 


GCCAATAAAG 


GAGAGAACAC 


2121 


CAGCTTGTTA 


CACCCTGTGA 


GCCTGCATGG 


AATGGATGAC 


CCTGAGAGAG 


AAGTGTTAGA 


GTGGAGGTTT 


2191 


GACAGCCGCC 


TAGCATTTCA 


TCACGTGGCC 


CGAGAGCTGC 


ATCCGGAGTA 


CTTCAAGAAC 


TGCTGACATC 


2261 


GAGCTTGCTA 


CAAGGGACTT 


TCCGCTGGGG ACTTTCCAGG 


GAGGCGTGGC 


CTGGGCGGGA 


CTGGGGAGTG 


2331 


GCGAGCCCTC 


AGATGCTGCA 


XATAAGCAGC 


TGCTTTTTGC 


CTGTACTGGG 


TCTCTCTGGT 


XAGACCAGAT 


2401 


CTGAGCCTGG 


GAGCTCTCTG 


GCTAACTAGG 


GAACCCACTG 


CTTAAGCCTC 


AATAAAGCTT 


GCCTTGAGTG 


2471 


CTTCAAGTAG 


TGTGTGCCCG 




GACTCTGGTA 


ACTAGAGATC 


CCTCAGACCC 


TTTTAGTCAG 



2541 TGTGGAAAAT CTCTAGCACC CCCCAGGAGG TAGAGGTTGC AGTGAGCCAA GATCGCGCCA CXGCATTGGA 



2611 GCCTGGGCAA GAAAACAAGA CTGTCTAAAA TAATAATAAT AAGTTAAGGG TATTAAATAT ATTTATACAT 
2681 GGAGGTCATA AAAATATATA TATTTGGGCT GGGCGCAGTG GCTCACACCT GCGCCCGGCC CT7TGGGAGG 
2751 CCGAGGCAGG TGGATCACCT GAGTTTGGGA GTTCCAGACC AGCCTGACCA ACATGGAGAA ACCCCXTCTC 
2821 TGTGTATTTT TAGTAGATTT TATTTTATGT GTATTTTATT CACAGGTATT TCTGGAAAAC TGAAACTGTT 
2891 TTTCCTCXAC TCTGATACCA CAAGAATGAT CAGCACAGAG GAAGACTTCT GTGATCAAAT GTGCTGGGAG 
2961 AGGGAGGTTT TCACCAGCAC ATGAGCAGTC AGTTCTGCCG CAGACTCGGC GGGTGTCCTT CGGTTGAGTT 
3031 CCAACACCGC CTGCCTGGAG AGAGG7CAGA CGAGAGGGTG AGGGCTCAGT CCCCAAGACA TAAACACCCA 
3101 AGACAXAAAC ACCCAACAGG TCCACCCCGC CTGCTGCCCA GGCAGAGCCG ATTCACCAAG ACGGGAATTA 
3171 GGATAGAGAA AGAGTAAGTC ACACAGAGCC GGCTGTGCGG GAGAACGGAG TTCT A TTATG ACTCAAATCA 
3241 GTCTCCCCAA GCATTCGGGG ATCAGAGTTT TTAAGGATAA CTTAGTGTGT AGGGGGCCAG TGAGTTGGAG 
3311 ATGAAAGCGT AGGGAGTCGA AGGTGTCCTT TTGCGCCGAG TCAGTTCCTG GGTGGGGGCC ACAAGATCGG 
3381 ATGAGCCAGT TTATCAATCC GGGGGTGCGA GCTGATCCAT GGAGTGCAGG GTCTGCAAAA TATCTCAAGC 
3451 ACTGATTGAT CT7AGGTTTT ACAATAGTGA TGTTACCCCA GGAACAATXT GGGGAAGGTC AGAATCTTGT 
3521 AGCCTGTAGC TGCATGACTC CXAAAOCATA ArTTCTTTTT IVAUTm TT TTTTXATTTT TGAGACAGGG 

Pstl (3639) 

3591 TCTCACTCTG TCACCXAGGC TGGAGTGCAG TGGTGCAATC ACAGCTCACT GCAGCCCCXA GAGCGGCCGC 
3661 CACCGCGGTG GAGCTCCAAT TCGCCCTATA GTGAGTCGTA TTACAATTCA CTGGCOGTCG TTTTACAACG 
3731 TCGTGACTGG GAAAACCCTG GCGTTACCCA ACTIAAICGC CTTGCAGCAC ATCCCCCTTT CGCCAGCTGG 
3801 CGTAATAGCG AAGAGGCCCG CACCGATCGC CCXTCCGAAC AGTTGOGGAG CCTGAATGGC GAATGGCGOG 
3871 AAATTGTAAA CGTTAATATT TTGTTAAAAT TCGCGTXAAA TTTTTGTTAA ATCAGCTCAT TTTTTAACCA 
3941 ATAG GCCGAA ATCGGGAAAA TCCCTTATAA ATGAAAAGAA XAGACCGAGA TAGGGTTGAG TGTTGITCCA 
4011 GTTTGGAACA AGAGTCGACT ATTAAAGAAC GTGGACTCCA ACGTCAAAGG GCGAAAAACC GICTATCAGG 
4081 GCGATGGCCC ACTACGTGAA CCATCACCCT AATCAAGTTT TTTGG GG TOG AGGTGCCGTA AAGCACTAAA 
4151 TCGGAACCCT AAAGGGAGCC CCCGATTTAG AGCTTGACGG GGAAAGCCGG GGAACGTGGC GAGAAAGGAA 
4221 GGGAAGAAAG CGAAAGGAGC GGGCGCTAGG GCGCTGGCAA GTGTAGCGGT CACGCTGCGC GXAACCAOCA 
4291 CACCCGCCGC GCTTAATGCG CCGCTACAGG GCGCGTCCCA GGTGGCACTT TTCGGGGAAA TGTGCGCGGA 
4361 ACCCCTATTT GTTTATTTTT CTAAAXACAT TCAAAXAXGT ATCCGCTCAT GAGACAAXAA CCCTGATAAA 
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4431 TGCTTCAATA ATATTGAAAA AGGAAGAGTA TGAGTATTCA ACATTTCCCT CTCGCCCTTA TTCCCTTTTT 
4501 TGCGGCATTT TGCCTTCCTG TTTTTGCTCA CCCAGAAACG CTGGTGAAAG TAAAAGATGC TGAAGATCAG 
4571 TTGGGTGCAC GAGTGGGTTA CATCGAACTG GATCTCAACA GCGGTAAGAT CCTTGAGAGT TTTCGCCCCG 
4641 AAGAACGTTT 'TCCAATGATG AGCACTTTTA AAGTTCTGCT ATGTGGCGCG GTATTATCCC GTATTGACGC 
4711 CGGGCAAGAG CAACTCGGTC GCCGCATACA CTATTCTCAG AATGACTTGG TTGAGTACTC ACCAGTCACA 
4781 GAAAAGCATC TTACGGATGG CATGACAGTA AGAGAATTAT GCAGTGCTGC CATAACCATG AGTGATAACA 
4851 CTGCGGCCAA CTTACTTCTG ACAACGATCG GAGGACCGAA GGAGCTAACC GCTTTTTTGC ACAACATGGG 
4921 GGATCATGTA ACTCGCCTTG ATCGTTGGGA ACCGGAGCTG AATGAAGCCA TACCAAACGA CGAGCGTGAC 
4991 ACCACGATGC CTGTAGCAAT GGCAACAACG TTGCGCAAAC TATTAACTGG CGAACTACTT ACTCTAGCTT 
5061 CCCGGCAACA ATTAATAGAC TGGATGGAGG CGGATAAAGT TGCAGGACCA CTTCTGCGCT CGGCCCTTCC 
5131 GGCTGGCTGG TTTATTGCTG ATAAATCTGG AGCCGGTGAG CGTGGGTCTC GCGGTATCAT TGCAGCACTG 
5201 GGGCCAGATG GTAAGCCCTC CCGTATCGTA GTTATCTACA CGACGGGGAG TCAGGCAACT ATGGATGAAC 
5271 GAAATAGACA GATCGCTGAG ATAGGTGCCT CACTGATTAA GCATTGGTAA CTGTCAGACC AAGTTTACTC 
5341 ATATATACTT TAGATTGATT XAAAACTTCA TTTTTAATTT AAAAGGATCT AGGTGAAGAT CCTTTTTGAT 
5411 AATCTCATGA CCAAAATCCC TTAACGTGAG TTTTCGTTCC ACTGAGCGTC AGACCCCGTA GAAAAGATCA 
5481 AAGGATCTTC TTGAGATCCT TTTTTTCTGC GCGTAATCTG CTGCTTGCAA ACAAAAAAAC CACCGCTACC 
5551 AGCGGTGGTT TGTTTGCCGG ATCAAGAGCT ACCAACTCTT TT7CCGAAGG TAACTGGCTT CAGCAGAGCG 
5621 CAGATACCAA ATACTGTCCT TCTAGTCTAG CCGTAGTTAG GCCACCACTT CAAGAACTCT GTAGCACCGC 
5691 CTACATACCT CGCTCTGCTA ATCCTGTXAC CAGTGGCTGC TGCCAGTGGC GATAAGTCGT GTCTTACCGG 
5761 G7TGGACTCA AGACGATAGT TACCGGATAA GGCGCAGCGG TCGGGCTCAA CGGGGGGTTC GTGCACACAG 
5831 CCCAGCTTGG AGCGAACGAC CTACACCGAA CTGAGATACC TACAGOGTGA GCTATGAGAA AGCGCCACGC 
5901 TTCCCGAAGG GAGAAAGGCG GACAGGTATC CGGTAAGCGG CAGGGTCGGA ACAGGAGAGC GCACGAGGGA 
5971 GCTTCCAGGG GGAAACGCCT GGTATCTTTA TAGTCCTGTC GGGTTTCGCC ACCTCTGACT TGAGCGTCGA 
6041 TTTTTGTGAT GCTCGTCAGG GGGGCGGAGC CTATGGAAAA ACGCCAGCAA CGCGGCCTTT TTACGGTTCC 
6111 TGGCCTTTTG CTGGCCTTTT GCTCACATGT TCTTTCCTGC GTTATCCCCT GATTCTGTGG ATAACCGTAT 
6181 TACCGCCTTX GAGTGAGCTG ATACCGCTCG CCGCAGCCGA ACGACCGAGC GCAGCGAGTC AGTGAGCGAG 
6251 GAAGCGGAAG AGCGCCCAAT ACGCAAACCG CCTCTCCCCG CGCGTTGGCC GATTCATTAA TGCAGCTGGC 
6321 ACGACAGGTT TCCCGACTGG AAAGCGGGCA GTGAGCGCAA CGCAATTAAT GTGAGTTAGC TCACTCATTA 
6391 GGCACCCCAG GCTTTACACT TTATGCTTCC GGCTCGTATG TTGTGTGGAA TTGTGAGCGG ATAACAATTT 
6461 CACACAGGAA ACAGCTATGA CCATGATTAC GCCAAGCTCG GAATTAACCC TCACTAAAGG GAACAAAAGC 
Pstl (6533) 

6531 TGCTGCAGGG TCCCTAACTG CCAAGCCCCA CAGTGTGCCC TGAGGCTGCC CCTTCCTTCT AGCGGCTGCC 
6601 CCCACTCGGC TXTGCTTTCC CTAGTTTCAG TTACTTGCGT TCAGCCAAGG TCTGAAACTA GGTGCGCACA 
6671 GAGCGGTAAG ACTGCGAGAG AAAGAGACCA GCTTTACAGG GGGTTXATCA CAGTGCACCC TGACAGTCGT 
6741 CAGCCTCACA GGGGGTTTAT CACATTGCAC CCTGACAGTC GTCAGOCTCA CAGGGGGTTT ATCACAGTGC 
6811 ACCCTTACAA TCATTCCATT TGATTCACAA TTTTTTTAGT CTCTACTGTG CCTAACTTGT AAGTTAAATT 
6881 TGATCAGAGG TGTGTTCCCA GAGGGGAAAA CAGTAXATAC AGGGTTCAGT ACTATCGCAT TTCAGGCCTC 
6951 CACCTGGGTC TTGGAATGTG TCCCCCGAGG GGTGATGACT ACCTCAGTTG GATCTCCACA GGTCACAGTG 
7021 ACACAAGATA ACCAAGACAC CTCCCAAGGC TACCACAATG GGCCGCCCTC CACGTGCACA TGGCCGGAGG 
7091 AACTGCCATG TCGGAGGTGC AAGCACACCT GCGCATCAGA GTCCTTGGTG TGGAGGGAGG GACCAGCGCA 
7161 GCTTCCAGCC ATCCACCTGA TGAACAGAAC CTAGGGAAAG CCCCAGTTCT ACTTACACCA GGAAAGGC 
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